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ABSTRACT 

We use the halo occupation model to calibrate galaxy group finders in magnitude- 
limited rcdshift surveys. Since, according to the current scenario of structure forma- 
tion, galaxy groups are associated with cold dark matter halos, we make use of the 
properties of the halo population in the design of our group finder. The method starts 
with an assumed mass-to-light ratio to assign a tentative mass to each group. This mass 
is used to estimate the size and velocity dispersion of the underlying halo that hosts 
the group, which in turn is used to determine group membership (in redshift space). 
This procedure is repeated until no further changes occur in group memberships. We 
find that the final groups selected this way are insensitive to the mass-to-light ratio 
assumed. We use mock catalogues, constructed using the conditional luminosity func- 
tion (CLF), to test the performance of our group finder in terms of completeness of 
true members and contamination by interlopers. Our group finder is more successful 
than the conventional Friends-of- Friends group finder in assigning galaxies in common 
dark matter halos to a single group. We apply our group finder to the 2-degree Field 
Galaxy Redshift Survey and compare the resulting group properties with model pre- 
dictions based on the CLF. For the ACDM 'concordance' cosmology we find a clear 
discrepancy between the model and data in the sense that the model predicts too many 
rich groups. In order to match the observational results we have to either increase the 
mass-to-light ratios of rich clusters to a level significantly higher than current obser- 
vational estimates, or to assume erg — 0.7, compared to the 'concordance' value of 
0.9. 

Key words: dark matter - large-scale structure of the universe - galaxies: halos - 
methods: statistical 



1 INTRODUCTION 

It is common practice to apply a group finder to large galaxy 
redshift surveys in order to assign galaxies to groups^. The 
clustering properties of galaxies can then be studied by 
analysing the spatial clustering of the groups and the distri- 
bution functions of the groups with respect to their inter- 
nal properties, such as luminosity, velocity dispersion, mass, 
shape, and galaxy population. Such analyses have been car- 
ried out using various galaxy redshift surveys, most notice- 
ably the CfA redshift survey (e.g. Geller & Huchra 1983), 
the Las Campanas Redshift Survey (e.g. Tucker et al. 2000) , 
the 2-degree Field Galaxy Redshift Survey (hereafter 2dF- 

* E-mail: xhyang@astro.umass.edu 

t In this paper, we refer to a system of galaxies as a group re- 
gardless of its richness, including isolated galaxies (i.e., groups 
with a single member). 



GRS; Merchan & Zandivarez 2002; Eke et al. 2004a, 2004b), 
and the Sloan Digital Sky Survey (hereafter SDSS; Bahcall 
et al. 2003; Lee et al. 2004). 

One of the problems in such analyses is that the prop- 
erties of the groups depend on the group finder used to 
identify groups, and it is in general quite hard to judge 
whether the groups identified are true associations of galax- 
ies in space. There are several reasons for such complication. 
First of all, galaxies are observed in redshift-space, not in the 
real space, and so every group finder has to contend with 
the redshift distortion in the apparent clustering patterns 
of galaxies. Secondly, redshift surveys are usually apparent 
magnitude-limited, and so any criteria for clustering based 
on the distances between galaxies has to take into account 
the variation of the mean inter-galaxy separation with dis- 
tance. Thirdly, since only a finite number of galaxies can be 
observed for each group, shot noise can also affect the accu- 
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racy of membership assignment, especially for small groups 
that contain only a small number of observable galaxies. 

One may argue that this is less a real problem than a 
matter of definition for galaxy groups. However, since the ul- 
termate goal of such analyses is to compare with theory, the 
definition must have a sound physical basis, in order to make 
a meaningful comparison between observation and theory. 
From the point of view of current theory of galaxy forma- 
tion, the most natural reference for defining galaxy groups 
is dark matter halos. These are quasi-equilibrium systems 
of dark matter particles, formed through non-linear gravi- 
tational collapse. In the standard ACDM model favored by 
current observations, most mass at any given time is bound 
within dark halos; galaxies and other luminous objects are 
assumed to form by cooling and condensation of the baryons 
within halos. Thus, it is extremely useful to have a group 
finder that can group galaxies according to their common 
dark halos. Unfortunately, dark halos are not directly ob- 
servable, and so it is not possible to define groups directly 
from dark matter distribution. On the other hand, since the 
current CDM scenario is very successful in explaining a very 
large range of observational data (e.g. Spergel et al. 2003), 
we may use the properties of dark halos in current CDM 
models as a guide in our design of a group finder. Much 
progress has been made in recent years in understanding the 
relationship between CDM halos and galaxies, based on nu- 
merical simulations (e.g., Katz, Weinberg & Hernquist 1996; 
Pearce et al. 2000) and semi-analytic models (e.g.,Somerville 
& Primack 1999; Kauffmann et al. 1999; Cole et al. 2000). 
However, our understanding for the details about galaxy for- 
mation in CDM halos is still quite poor, and so calibrations 
of group finders have not been carried out in a reliable, 
model-independent way. Furthermore, to produce a mock 
catalogue that can not only cover a volume as large as cur- 
rent large redshift surveys of galaxies, such as 2dFGRS and 
SDSS, but also have the ability to resolve faint galaxies, is 
not trivial for both numerical simulation and semi-analytical 
modelling. What one really needs is an empirical model that 
can correctly partition the galaxy population into dark ha- 
los but does not depend on uncertain details about galaxy 
formation in dark halos. 

The halo occupation model recently developed has this 
spirit. In this model, one simply specifies halo occupation 
numbers, (N(M)), which describe how many galaxies on av- 
erage occupy a halo of mass M. Many recent investigations 
have used such halo occupation models to study various as- 
pects of galaxy clustering (Jing, Mo & Borner 1998; Pea- 
cock & Smith 2000; Seljak 2000; Scoccimarro et al. 2001; 
White 2001; Jing, Borner & Suto 2002; Bullock, Wech- 
sler & Somerville 2002; Berlind & Weinberg 2002; Scranton 
2002; Kang et al. 2002; Marinoni & Hudson 2002; Zheng 
et al. 2002; Magliocchetti & Porciani 2003; Kochanek et 
al. 2003; Yan, Madgwick& White 2003; Yan, White & Coil 
2004). In two recent papers, Yang, Mo &: van den Bosch 
(2003; hereafter Paper I) and van den Bosch, Yang & Mo 
(2003; hereafter Paper II) have taken this halo occupation 
approach one step further by considering the occupation 
as a function of galaxy luminosity and type. They intro- 
duced the conditional luminosity function (hereafter CLF) 
$(L\M)dL, which gives the number of galaxies with lumi- 
nosities in the range L±dL/2 that reside in halos of mass M. 
The advantage of this CLF over the halo occupation func- 



tion (N(M)) is that it allows one to address the clustering 
properties of galaxies as function of luminosity. In particu- 
lar, such model can be used to pupulate dark matter halos 
in high-resolution A-body simulations to construct realistic 
mock galaxy redshift surveys that automatically have the 
correct galaxy abundances and correlation lengths as func- 
tion of galaxy luminosity and type. As shown in Yang et al. 
(2004), the mock galaxy catalogues constructed in this way 
can recover many of the properties of galaxy clustering in 
redshift space. Further analyses with such mock catalogues 
show that the model is also successful in matching the envi- 
ronmental dependence of galaxy luminosity function (Mo et 
al. 2004), the kinematics of satellite galaxies around galaxies 
(van den Bosch et al. 2004ab) and the three-point correla- 
tion function (Wang et al. 2004). 

The aim of this paper is two-fold. First, we intend to 
develop a group finder that is based on the consideration 
given above, and can group galaxies according to their com- 
mon halos. We test the reliability of the method with the 
extensive use of various mock samples constructed from the 
CLF model. Secondly, we apply the group finder to the 2dF- 
GRS, and compare the properties of the groups so obtained 
with model predictions. The outline of the paper is as fol- 
lows. In Section 2 we describe our halo-based group finder, 
which we test in Section 3 with the extensive use of detailed 
mock samples constructed from the CLF model. In Section 4 
we apply our group finder to the 2dFGRS, and compare the 
properties of the groups thus obtained with model predic- 
tions. We summarise our results in Section 5. 

Unless stated otherwise, we consider a flat ACDM 
cosmology with Q m = 0.3, S1a = 0.7 and h — 
#o/(100 kms -1 Mpc -1 ) = 0.7 and with initial density fluc- 
tuations described by a scale-invariant power spectrum with 
normalisation as = 0.9. These cosmological parameters are 
in good agreement with a wide range of observations, includ- 
ing the recent WMAP results (Spergel et al. 2003), and in 
what follows we refer to it as the "concordance" cosmology. 



2 THE GROUP FINDER 

Our aim here is to develop a group finder that assigns galax- 
ies in a common halo to a single group. The properties of 
the halo population in the standard ACDM model are well 
understood, largely due to a combination of A-body simula- 
tions and analytical models. Dark matter halos are defined 
as virialized structures with a mean over-density of about 
180. Their density profiles are well described by the so-called 
NFW profile (Navarro, Frenk & White 1997): 

PW = (r/r s )(l + r/r s ) 2 ' W 

where r s is a characteristic radius, p is the average density of 
the Universe, and 5 is a dimensionless amplitude which can 
be expressed in terms of the halo concentration parameter 

c = ri8o/r s as 

i = 180 c 3 

3 ln(l + c) - c/(l + c) ' 1 ' 

Here riso is the radius within which the halo has an average 
over-density of 180. Numerical simulations show that the 
halo concentration depends on halo mass, we use the relation 
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given by Bullock et al. (2001) and properly rescaled to our 
definition. 

Both observations and numerical simulations suggest 
that the brightest galaxy in each dark matter halo resides 
at rest at the centre, while the number density distribution 
of the fainter, satellite galaxies matches that of the dark 
matter particles (e.g., Carlberg et al. 1997; van der Marel 
et al. 2000; Berlind et al. 2003; Lin, Mohr & Stanford 2004; 
Rines et al. 2004; Diemand et al. 2004). Furthermore, al- 
though velocity bias may exist in the sense that galaxies 
move with different velocities as the dark matter particles at 
the same location, such bias is found to be not large (Berlind 
et al. 2003, Yoshikawa et al. 2003). These findings suggest 
that it may be possible to use information regarding the 
spatial distribution and velocity dispersion of galaxy group 
members to estimate halo masses, and vice versa. Motivated 
by these considerations, we design a group finder that con- 
sists of the following steps: 

Step 1: We combine two different methods to identify 
the centres of potential groups. First we use the traditional 
Friends-Of-Friends (FOF) algorithm to assign galaxies into 
groups. Since we are working in redshift space, we sepa- 
rately define linking lengths along the line of sight (£ z ) and 
in the transverse direction ((. p ). Since the purpose here is 
only to identify the group centres, we use relatively small 
linking lengths: £ z = 0.3 and i v = 0.05, both in units of 
the mean separation of galaxies. Note that for an apparent 
magnitude limited survey the mean separation of galaxies is 
a function of redshift, which we take into account. The geo- 
metrical centres of all FOF groups thus identified with more 
than 2 galaxies are considered as centres of potential groups. 
Next, from all galaxies not yet linked together by these FOF 
groups, we select bright, relatively isolated galaxies which we 
also associate with the centres of potential groups. Following 
McKay et al. (2002), Prada et al. (2003), Brainerd & Spe- 
cian (2003) and van den Bosch et al. (2004a), we identify 
a galaxy as 'central', and thus as the centre of a potential 
group, when it is the brightest galaxy in a cylinder of radius 
1 ft _1 Mpc and velocity depth ±500 kms -1 . 

Step 2: We estimate the luminosity of a selected po- 
tential group using 

u 



= E 



(3) 



where L; is the luminosity of each galaxy in the group, and 
f c is the luminosity-dependent incompleteness of the survey 
(which is relevant only when the group finder is applied to 
real data, such as the 2dFGRS). The total luminosity of the 
group is approximated by 



itotal — La 



J™L<j>(L)dL 
' L<t>{L)dL 



(4) 



where Ln m is the minimum luminosity of a galaxy that can 
be observed at the redshift of the group, and 4>(L) is the 
galaxy luminosity function 

*(L)dL = *. ) d ± . (5) 

Throughout this paper, galaxy luminosities are defined in 
the photometric bj band. For the luminosity function, wc 
take the parameters from Norberg et al. (2002): (4>,,M,,a) 
= (0.0161, -19.66, -1.21). 



Step 3: From L to tai and a model for the group mass-to- 
light ratio (see below), we compute an estimate of the halo 
mass associated with the group in consideration. From this 
estimate we also compute the halo radius riso, the virial 
radius r v i r ^, and the virial velocity Vwr = (GMjr^) 1 ^ . 
The line-of-sight velocity dispersion of the galaxies within 
the dark matter halo is assumed to be a — V v i r /\/2. 

Step 4: Once we have a group centre, and a tentative 
estimate of the group size, mass, and velocity dispersion, we 
can assign galaxies to this group according to the properties 
of the associated halos. If we assume that the phase-space 
distribution of galaxies follows that of the dark matter parti- 
cles, the number density contrast of galaxies in redshift space 
around the group centre (= centre of dark matter halo) at 
redshift % roup can be written as 

P M (R,Az) = ^^-p(Az), 



(6) 



Here Az — z — z gIoup and S(i?) is the projected surface 
density of a (spherical) NFW halo: 



£(i?) = 2 r s 8 p f(R/r c ) , 
with 



/(*) = { 




l / ^ atan^/ x 2 — 1 \ 



if x < 1 

if x = 1 
if x > 1 



(7) 



(8) 



The function p(Az)dAz describes the redshift distribution 
of galaxies within the halo for which we adopt a Gaussian 
form 



p(Az) 



2na(l + Zgroup) 



exp 



-(cAz 



2a 2 (l + z R 



(9) 



where a is the rest-frame velocity dispersion. 

Thus defined, Pm(R, Az) is the three-dimensional den- 
sity contrast in redshift space. In order to decide whether 
a galaxy should be assigned to a particular group we pro- 
ceed as follows. For each galaxy we loop over all groups, and 
compute the corresponding distance (R, Az) between galaxy 
and group centre. Here R is the projected distance at the 
redshift of the group. If Pm(R,Az) > B, with B an ap- 
propriately chosen background level, the galaxy is assigned 
to the group. If a galaxy can be assigned to more than one 
group, it is only assigned to the group for which Pm{R, Az) 
has the highest value. Finally, if all members of two groups 
can be assigned to one group according to the above crite- 
rion, the two groups are merged into a single group. The 
background level B defines a threshold density contrast in 
redshift space whose value we calibrate using mock galaxy 
catalogues (see Section 3.2). 

Step 5: Using the group members thus selected we re- 
compute the group-centre and go back to Step 2, iterat- 
ing until there is no further change in the memberships of 
groups. Note that, unlike with the traditional FOF method, 



t The virial radius is defined as the radius inside of which the 
average density is A v ; r times the critical density, with A v ; r given 
by Bryan & Norman (1998) 
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this group finder also identifies groups with only one mem- 
ber. 

The basic idea of this group finder is similar to that 
of the matched filter algorithms developed by Postman et 
al. (f996) (see also Kepner et al. f999; White & Kochanek 
2002; Kim et al. 2002; Kochanek et al. 2003), although we 
also make use of the galaxy kinematics in our analyses. 
The crucial ingredient of our method is to somehow ob- 
tain a reliable estimate of the halo mass associated with the 
galaxy group from observable properties of its selected mem- 
ber galaxies. Ideally, this mass estimate should be model- 
independent. For example, one might hope to estimate the 
mass from the velocity dispersion of the selected members, 
using the fact that M oc a . Unfortunately, we found that 
this is not a reliable method, simply because the estimate of 
a is too noisy when only few group members are available. 
Therefore, we decided to use a model for the mass-to-light 
ratio to estimate the group mass from the total luminosity 
of the selected galaxies. Since total group luminosity is dom- 
inated by the few brightest galaxies, the estimated mass is 
much less sensitive to the absence of faint group members. A 
downside of this method, however, is that it requires a model 
for the mass-to-light ratios. Fortunately, we found that the 
memberships of the selected groups are remarkably insensi- 
tive to the adopted model. The reason for this is that, even 
if the estimated mass is wrong by a factor of 10, the implied 
radius and velocity dispersion, used in the membership de- 
termination, change only by a factor of 2.15 

In what follows we use the average mass-to-light ratios 
obtained by van den Bosch, Yang & Mo (2003) from the 
CLF formalism 



M 
T 



— ( —\ ( M \ " 71 i { M V 2 

2 \ L ) \ \MiJ "•" \ M 1 ) 

(M/L) cl 



if M14 < 1 
if M14 > 1 



(10) 



where we set the model parameters to those of their 
model D: Mi = lO 10 ' 94 /)," 1 M , 71 = 2.02, 72 = 0.30, 
(M/L) = 124ft (M/L) Q , and (M/L) cl = 500ft (M/L) . M 14 
is the halo mass in units of lO 14 ft _1 M . As we demonstrate 
in the next section, even if we adopt a constant mass-to- 
light ratio in Step 3 of M/L = 400ft (M/L) Q for all halos, 
the selected groups are virtually identical to those selected 
when using these CLF-based mass-to- light ratios. 



3 TESTING THE GROUP FINDER 
3.1 Constructing Mock Samples 

We construct mock galaxy samples by populating dark mat- 
ter halos in numerical simulations with galaxies of different 
luminosities, using the conditional luminosity function for- 
malism developed in Papers I and II. In what follows we 
focus on CLF model D defined in paper II, which is valid 
for the concordance cosmology considered here, and which 
yields excellent fits to the observed LFs and the observed 
correlation lengths as function of both luminosity and type. 
The same CLF has also been used in Yang et al. (2004) to 
construct large, mock galaxy redshift surveys, which they 
used to compare various large scale structure statistics with 
those obtained from the 2dFGRS. 

In this paper we use the same mock samples to test our 
group finder and to construct detailed mock galaxy redshift 



surveys for comparison with the 2dFGRS. Here we briefly 
summarise the ingredients, but refer the reader to Yang et 
al. (2004) for details. The mock samples are constructed us- 
ing a set of iV-body simulations carried out by Y.P. Jing and 
Y. Suto (see Jing 2002; Jing & Suto 2002) on the VPP5000 
Fujitsu supercomputer of the National Astronomical Ob- 
servatory of Japan using a vectorized-parallel P 3 M code. 
The set consists of a total of six simulations, each of which 
uses N = 512 3 particles to evolve the distribution of dark 
matter from an initial redshift of z — 72 down to z — 
in a ACDM 'concordance' cosmology. All simulations con- 
sider boxes with periodic boundary conditions; in two cases 
Lbox = 100ft -1 Mpc while the other four simulations all have 
Lbox = 300ft" 1 Mpc. Different simulations with the same box 
size are completely independent realizations and are used to 
estimate uncertainties due to cosmic variance. The particle 
masses are 6.2 x 10 8 ft _1 M and 1.7 x 10 10 ft _1 M for the 
small and large box simulations, respectively. Dark matter 
halos are identified using the FOF algorithm with a linking 
length of 0.2 times the mean inter-particle separation. For 
each individual simulation we construct a catalogue of halos 
with 10 particles or more, for which we store the mass, the 
position of the most bound particle, and the halo's mean ve- 
locity and velocity dispersion. Halos that are unbound are 
removed from the sample. In Yang et al. (2004) we have 
shown that the resulting halo mass functions are in excel- 
lent agreement with the analytical halo mass function given 
by Sheth & Tormen (1999) and Sheth, Mo & Tormen (2001). 
The mock galaxy distributions are constructed by populat- 
ing the dark matter halos in these iV-body simulations with 
galaxies according to the CLF. The brightest galaxy in each 
halo is located at rest at the centre of the halo, while the 
other galaxies follow a number density distribution that is 
identical to the mass distribution of the dark matter and 
with an isotropic velocity dispersion that is identical to that 
of the dark matter. Finally, when using these mock sam- 
ples to create a mock 2dFGRS (hereafter MGRS), we stack 
simulations with different box sizes and resolutions in order 
to cover a large redshift range (to match the depth of the 
2dFGRS) and to properly sample the faint galaxies. Obser- 
vational errors and selection effects in the 2dFGRS, such as 
position-dependent completeness and magnitude-limit vari- 
ations are all taken into account. From our set of six simu- 
lation boxes we construct eight independent MGRSs, which 
we use to estimate the impact of cosmic variance. 



3.2 Volume limited samples 

Before applying our group finder to the MGRSs and the 
2dFGRS, we first test and calibrate it using a volume lim- 
ited sample of mock galaxies in one of our simulation boxes 
with L box = 100ft" 1 Mpc. To mimic observation, we simply 
assume redshifts to be along one side of the simulation box. 

In what follows a true group is defined as the set of 
galaxies that reside in the same halo. In order to quantify 
the group finder performance we introduce the complete- 
ness, F t , defined as the ratio between the number of true 
members selected by the group finder and the total number 
of true group members, and the contamination, F, defined 
as the ratio between the number of false members (inter- 
lopers) selected by the group finder and the total number of 
true members. A perfect group finder has F t = 1 and Fi = 0. 
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Figure 1. The value of \P = Ft x (1 — F) (where ft is the completeness factor and fl is the contamination factor) as a function of halo 
mass for different choices of the background level B. The circles show the result for our fiducial group finder with B = 10. The solid 
and dashed lines correspond to B = 5 and 20, respectively, while the dotted and dot-dashed lines outline two extreme cases with B = 
and 100, respectively. The four different panels show results using different luminosity cuts: only galaxies brighter than the absolute 
magnitude limit indicated (where M£ = M^j — 51ogh) are used to identify groups and to define Ft and Fi. 



Therefore, we optimise our group finder by simultaneously 
maximising Ft and minimising Fi. Note that these two cri- 
teria may sometimes be in conflict. For example, a certain 
group finder may select all true members but at the expense 
of including many interlopers. Other group finders may yield 
zero interlopers but miss a large fraction of true members. 
Whether one gives more weight to large F t or small Fi de- 
pends on the question one wishes to address. In this paper 
we give equal weight to both criteria, and we tune our pa- 
rameter in the group finder, i.e. the background level B, so 



that the value of $ = Ft x (1 — Fi) is maximised. The back- 
ground level B defined in this way is about 10 for M* halos, 
and ^ has a weak dependence on the mass of the system. 

The performance of our group finder is not very sensi- 
tive to the exact value of B used. This is evident from the 
various curves in Fig. 1, which show that similar values of 
^ are obtained using a fairly large range of background lev- 
els. To put the absolute values of B in perspective we recall 
that it is used as a threshold for the redshift-space density 
contrast of groups. Ideally, B should therefore correspond 
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Figure 2. Red and blue points indicate the completeness and contamination, respectively, of individual groups as function of halo mass. 
Solid dots and open triangles indicate the corresponding averages with the errorbars indicating the l-cr variance. Results correspond to 
our fiducial group finder with B = 10. Different panels correspond to different absolute magnitude limits as in Fig. 1. 



roughly to the redshift-space density contrast at the edge of 
a halo, i.e., 

B ^ Prcdfriso) ^ ^(riso) (47r/3)r 180 
~ P ~ P ^ 80 a/H 

Using that p(r\%o)/p ~ 30 and cr/iYo^iso ~ 4, we obtain 
B ~ 10, in excellent agreement with the value obtained by 
maximising ^ . Therefore, in what follows we adopt B — 10, 
independent of halo mass. 

Fig. 2 gives a more detailed overview of the complete- 
ness and contamination for our fiducial group finder with 
B — 10. Dots correspond to individual groups (halos), while 



the big symbols with errorbars indicate the average Ft (solid 
circles) and Fj (open triangles). Only galaxies brighter than 
a certain absolute-magnitude limit (indicated in the upper 
right corner of each panel) are used to identify groups. The 
incompleteness and contamination are also defined with re- 
spect to all group members brighter than that limit. Overall 
the group finder is quite successful, with an average com- 
pleteness of about 90% and a contamination of about 20% 
for groups in dark matter halos with masses spanning the 
entire range from ~ 10 12 /t -1 M Q to ~ 10 15 /i _1 M Q . Note also 
that the completeness and contamination do not depend sig- 
nificantly on the luminosity limit of the tracer galaxies. This 
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Figure 3. Same as in Fig. 2, but here wc compare different group finder methods. The thick (thin) lines and symbols show the 
completeness (contamination) of the groups. The symbols with errorbars are the same results as in Fig. 2. The solid lines show the 
results in which wc assume a constant mass-to-light ratio M/L = 400(/iMq/Lq). The dot-dashed lines show the results where we use 
the traditional FOF group finder method with the same linking lengths as in Eke et al. (2004a). 



implies that our group finder can make a fairly uniform iden- 
tification of galaxy groups even in an apparent-magnitude 
limited sample. As we shall see in Section 3.3, this is indeed 
the case. 

In the group finder used thus far we have used the 
mass-to- light ratio predicted by the CLF (eq. [10]) to es- 
timate the sizes and velocity dispersions of the groups. In 
order to test the sensitivity of the group finder to this model 
assumption, we performed a number of tests with differ- 
ent mass-to-light ratios. One of the more extreme exam- 
ples tested is a model with constant mass-to-light ratio of 



M/L = 400/i (M/L)0 independent of halo mass. As shown 
in Fig. 3 the resulting completeness and contamination are 
very similar to those obtained using our fiducial mass-to- 
light ratios of eq. (10). We obtained similar results for all 
mass-to- light ratios tested, indicating that the completeness 
and contamination levels of our group finder are extremely 
insensitive to the exact mass-to-light ratios assumed. 

Fig. 3 also shows the results for groups identified using 
the standard friends-of-friends (FOF) algorithm (i.e., only 
using Step 1 in Section 2). We use exactly the same linking 
lengths as in Eke et al. (2004a): 
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Figure 4. Comparison of the group luminosity L groU p (the total luminosity of all identified group members) with the true group 
luminosity -Ltruo (the total luminosity of all true group members). Only results for groups with more than 2 members are shown. The 
four different panels correspond to different absolute magnitude cuts (as indicated), where M' b = M[, ; — 51og/i. 



l v = 0.13 (A/5) ' 04 n~ 1/3 (12) 
in the transverse direction, and 

£ z = 1.43 (A/5) ' 16 n- 1/a (13) 

along the line of sight, with n the number density of galax- 
ies. The quantity A is the galaxy density contrast relative to 
the background at the redshift considered in a cylinder of ra- 
dius 1.5ft _1 Mpc and velocity depth ±1650kms _1 . Although 
the completeness given by the FOF algorithm is similar to 
our group finder, the contamination is significantly higher. 
In addition the contamination in poor groups increases sys- 



tematically when only using brighter galaxies. Therefore, the 
FOF method does not give a uniform identification in an ap- 
parent magnitude limited sample (e.g. Diaferio et al. 1999). 

To further test the performance of our group finder, 
we compare properties of the identified groups with those of 
the true groups. Fig. 4 shows the predicted group luminosity 
£grou P (the sum of the luminosities of all members assigned 
by the group finder) versus the true group luminosity itruc 
(the sum of the luminosities of all true members). We only 
show results for groups with three or more selected mem- 
bers. Each of the four panels shows the result where galaxies 
brighter than a certain absolute magnitude limit are used in 



© 0000 RAS, MNRAS 000, 000-000 



A Halo-Based Galaxy Group Finder 9 




800 




w 

B 



800 



600 - 



400 - 



a 

3 

i P 



200 



200 400 600 
<7{° r s ue (km/s) 



800 




200 400 600 
a£ s ue (km/s) 



800 



Figure 5. Same as Fig. 4, except that here we compare the line-of-sight velocity dispersion of identified group members with the 
line-of-sight velocity dispersion of true members. All velocity dispersions are estimated using the gappcr estimator described in the text. 



selecting the groups and in calculating the group luminosi- 
ties. Overall the correlation is fairly tight, indicating that 
the incompleteness and the presence of interlopers does not 
strongly impact on total group luminosity. 

For a system in dynamical equilibrium, its mass may 
be estimated from its velocity dispersion through the virial 
theorem. Since the velocity dispersion of a group can be 
measured from the redshifts of member galaxies, it is in- 
teresting to examine how the velocity dispersion among se- 
lected members compares to that of the true members. We 
use the gapper estimator described by Beers, Flynn & Geb- 
hardt (1990) to estimate the group velocity dispersions. The 
method involves ordering the set of recession velocities {vi} 
of the N group members and defining gaps as 



gi = v i+1 -Vi, i = 1,2, AT — 1. (14) 
The rest-frame velocity dispersion is then estimated by 

N-l 

agap= (i+, grouP )V-i)^ Wi51 - (15) 

i — 1 

where the weight is defined as u>; = i(N — i). Since there is a 
central galaxy in each group (which is true in our mock sam- 
ples, and which we also assume to hold in the real Universe), 
the estimat ed velocity dispersion has to be corrected by a 
factor of y/N/{N - 1). In addition, both in the 2dFGRS 
and in our MGRSs there is a redshift measurement error of 
forr = 85kms _1 , which needs to be removed in quadrature. 
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Figure 6. Group luminosity functions. Histograms indicate the true luminosity functions, in which the luminosity of each group is the 
sum of the luminosities of all true group members brighter than the luminosity cut indicated in each panel. The solid lines correspond to 
the luminosity functions for all the groups identified with our fiducial group finder, while the thin solid lines indicate the corresponding 
luminosity functions obtained when only using groups with at least three identified members. The dashed lines, which almost overlap with 
the thick solid lines, indicate the group luminosity functions obtained when using a constant mass-to-light ratio of M/L = 4OO(fe(M/L)0) 
in the group finder. 



Thus the final velocity dispersion of group members is given 
by 



a= y max ^0,ip^-a c 2 rr J (16) 

(Eke et al. 2004a). Fig. 5 shows the velocity dispersions of 
groups obtained from the identified galaxies, cr g °o Up , versus 
those obtained from the true members, a\°u C - As in Fig. 4, 



only groups with three of more members are shown. Con- 
trary to the luminosities, the scatter here is quite large, and 
there is a clear trend that <7g°o Up < o-{°J c . This discrepancy 
is due to the fact that galaxies with the highest peculiar ve- 
locities in a group are the most likely to be missed by the 
group finder due to the incompleteness. We can reduce the 
fraction of missed members by lowering the background level 
B, but at the expense of a significant increase in the fraction 
of interlopers. This indicates that it is in general quite dif- 



© 0000 RAS, MNRAS 000, 000-000 



A Halo-Based Galaxy Group Finder 11 



ficult to accurately estimate the depth of the gravitational 
potential well of a dark matter halo from the velocities of 
its member galaxies identified in redshift space. 

The above tests are based on the properties of se- 
lected groups, but they do not address whether or not the 
group finder can find all the existing groups. To check the 
completeness of the group identification, we estimate the 
group luminosity function, which gives the number density 
of groups as a function of group luminosity. Fig. 6 shows the 
group luminosity functions estimated from selected groups 
with at least 1 (thick lines) or at least 3 (thin lines) mem- 
bers. The histogram indicates the luminosity function of all 
true groups (i.e., of all dark matter halos), and is shown for 
comparison. If we include all systems selected by our groups 
finder (single galaxies, binary systems, and groups with more 
than 2 members), the group catalogue is roughly complete 
down to a luminosity of about lO lo,5 /i~ 2 L0 (corresponding 
to a typical halo mass of ~ 1O 12 - 5 /i _1 M ). If, on the other 
hand, we exclude single galaxies and binary systems, the 
sample is only more or less complete down to a group lumi- 
nosity of about 1O 1O ' 8 /!~ 2 L . Finally, we tested the impact 
of the assumed mass-to-light ratio on the completeness of 
the group finder by using a constant M/L = 400/iMq / 'Lq. 
This leads to a group luminosity function that differs only 
very slightly from the fiducial case (dashed lines Fig. 6), in- 
dicating once again that our group finder is insensitive to 
the assumptions regarding the mass-to-light ratios. 

3.3 Flux limited samples 

Having tested our group finder on volume limited mock sam- 
ples, we now turn to more realistic, flux-limited samples. To 
that extent we use our mock 2dFGRS (hereafter MGRS), 
the construction of which is described in Section 3.1 and, in 
more detail, in Yang et al. (2004). 

Applying the group finder to these MGRSs yields the 
completeness and contamination shown in the left-hand 
panel of Fig. 7 (symbols with errorbars). The results are 
very similar to those for our volume limited mock samples 
shown in Fig. 2, indicating that our group finder works al- 
most equally well for flux limited samples as for volume lim- 
ited samples. For comparison, the dashed lines indicate the 
completeness and contamination obtained using the stan- 
dard FOF group finder of Eke et al. (2004a) , with the link- 
ing lengths given by eq. (12) and (13). Clearly, this group 
finder yields significantly more interlopers than our method, 
especially in small groups. The right-hand panel of Fig, 7 
compares the estimated group luminosity (obtained using 
our fiducial group finder) with the true group luminosity. 
Again the results are very similar to those obtained using 
volume limited samples (cf. Fig. 4) , and indicate that group 
luminosities can be obtained fairly robustly. The scatter in 
logL is about 0.1 in both directions, corresponding to an 
error of ~ 30% when the luminosity of a selected group is 
used to infer the true luminosity. 

As a final demonstration of the performance of our 
group finder, Fig. 8 plots the number of groups identified 
as a function of redshift. In the left-hand panel, we plot the 
redshift distribution of all identified groups (including those 
with only one member) with luminosity Lis > 10 10 ' 5 h~ 2 Lq 
(where Lis is a scaled group luminosity which will be defined 
later). Comparing these result with the solid line, which cor- 



responds to a constant number density, indicates that the 
group finder works remarkably uniformly over the entire red- 
shift range probed: the group completeness is virtually in- 
dependent of redshift. This success is partly due to the fact 
that our group finder can also identify systems that contain 
only one or two galaxies. If we impose a richness thresh- 
old, then the number of groups will decline with redshift at 
high z, as shown in the right-hand panel of Fig. 8. How- 
ever, the change with redshift of the number of the groups 
selected by our group finder matches that of the true groups 
remarkably well. In contrast, the FOF method of Eke et 
al. (2004a) yields far too many groups. For example, for 
groups with Lis > W 10 ' 6 h~ 2 Lo, our group finder selects 
7091 groups with richness N > 2, close to the true num- 
ber of 7040, while the FOF method selects 9665 groups. For 
N > 3, the true number of groups is 5231, while the number 
of groups selected are 4889 by our method and 6786 by the 
FOF method; for N > 4, the corresponding numbers are 
3820, 3567, and 4969. 

Although the completeness of the group identification 
is virtually redshift independent, the individual complete- 
ness of the identified groups does depend on redshift. Since 
in an apparent magnitude limited sample the mean num- 
ber density of galaxies decreases with increasing redshift, 
groups associated with similar halos will be richer at lower 
redshift. Caution is therefore required when using groups se- 
lected from an apparent-magnitude limited sample to study 
the intrinsic group properties. For example, if we consider 
groups with similar richness, we may in fact mix systems 
with different masses. One can circumvent this problem by 
using volume-limited samples to identify groups. This has 
the advantage that the identified groups are uniform in red- 
shift, but the disadvantage that it only makes limited use of 
the full observational data set. 

An alternative method, which we adopt in this paper, is 
to use the full apparent-magnitude limited sample to iden- 
tify galaxy groups. In this case, however, before comparing 
groups at different redshifts, one need to bring their intrinsic 
properties as determined from the detected member galaxies 
to a common scale. Here we focus on the group luminosity 
and investigate how to scale Lg roU p. One possibility is to 
compute Ltotai using eq. (4). In fact, many earlier analyses 
have used this approach to calculate the total luminosity or 
richness of the groups (Tucker 2000; Merchan & Zandivarez 
2002; Kochanek et al. 2003; Eke et al. 2004b). This method 
is based on the assumption that the galaxy luminosity func- 
tion in groups is similar to that of field galaxies. However, 
as shown in our earlier analyses, the galaxy luminosity func- 
tion in different halos (Yang, Mo & van den Bosch 2003; van 
den Bosch, Yang & Mo 2003) and different environments 
(Mo et al. 2004) may be very different. Therefore, it is not 
reliable to use eq (4) to estimate L tota i from Lg roU p. Here 
we suggest a more empirical approach. A nearby group se- 
lected in an apparent magnitude limited survey should con- 
tain all of its members down to a faint luminosity. We can 
therefore use these nearby groups to determine the relation 
between the group luminosity obtained using only galax- 
ies above a bright luminosity limit and that obtained using 
galaxies above a fainter luminosity limit. Assuming that this 
relation is redshift-independent, one can correct the lumi- 
nosity of a high-z group, where only the brightest members 
are observed, to an empirically normalised luminosity scale. 
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Figure 7. The left-hand panel shows the completeness (Ft) and contamination^;) of the groups selected from our MGRS. Symbols 
are as in Fig. 2, with the dashed lines indicating Ft and Fj obtained when using the traditional FOF method with the linking lengths 
of eq. (12) and (13). Note how this method yields much larger interloper fractions than our group finder, especially in low-mass halos. 
The right-hand panel plots the group luminosity, F groU p, for groups with N > 2 obtained using our fiducial group finder, versus the true 
group luminosity Ftruc (cf. Fig 4). 
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Figure 8. The number of groups detected in our MGRS as function of redshift. In the left-hand panel we plot results for all groups with 
N > 1 and Lis > 10 10 ' 5 /i — 2 Lq, with the open circles indicating the mean number of groups identified using our fiducial group finder. 
The errorbars indicate the l-cr variance obtained from all 8 MGRSs. The solid line indicates the expected relation for a constant group 
number density (i.e., for a group detection completeness that is redshift independent), and is shown for comparison. In the right-hand 
panel we show the same N gr - oup (z) but this time only for groups with N > 2. The open circles indicate the true N eIoup (z), with the solid 
and dashed lines indicating the numbers of groups detected using our fiducial group finder and the traditional FOF method, respectively. 



As common luminosity scale we use Lig, defined as the 
luminosity of all group members brighter than Mij = —18 + 
5 log h. To calibrate the relation between L gr0 up and Lis we 
first select all groups with z < 0.09, which corresponds to 
the redshift for which a galaxy with Mt,j = —18 + 5 log h has 
an apparent magnitude that is equal to the mean limit of the 



2dFGRS (bj < 19.3). Fig. 9 plots the ratio L ia /L group ob- 
tained from these groups, where L gr ou P is now defined as the 
luminosity of all group members brighter than a given limit. 
Different symbols correspond to different luminosity limits, 
as indicated in the right-hand panel. The left and right-hand 
panels correspond to our MGRS and the 2dFGRS, respec- 
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. Here Lis is the total luminosity of all group galaxies brighter than M;, — 5 log h = 

— 18.0, while L gr0 up is defined as the total luminosity of all group galaxies with Mf,j —5 log h < —20.0 (open circles), —19.5 (filled squares), 

— 19.0 (open squares), and —18.5 (filled triangles). The errorbars indicate 1-cr scatter of the ratios within different group luminosity bins, 
while the solid lines are our fits to these ratios, used to compute Lis from an observed L gr0 up (see text for details). Results in the panels 
on the left and right correspond to groups identified in our MGRS and the 2dFGRS, respectively. 



tively. It is clear that, for a variety of luminosity limits, the 
relation is quite tight. 

In what follows we proceed with two different ways. For 
all selected groups with z < 0.09 we compute Lis directly 
from the selected members with M^j < — 18 + 5 log ft. For 
groups at higher redshifts, we compute L SIOup and use the 
appropriate, average relation between Lig and Lgroup to es- 
timate the former. 

Fig. 10 plots Lis thus obtained from the groups in our 
MGRS as function of group halo mass. Clearly, Lig is quite 
tightly correlated with the halo mass in the model, and so 
the value of Lis can be used to divide groups according to 
their halo masses. For comparison, we also plot the theoret- 
ical prediction of the model group Lis - halo mass relation. 
The excellent match between the model prediction and the 
mean value confirms the accuracy of our group Lis estima- 
tion. 

The velocity dispersion of galaxies in a group is another 
mass indicator that can be estimated directly. As shown in 
Section 3.2, the velocity dispersion obtained directly from 
the identified members (true plus interlopers) may differ sys- 
tematically from the true velocity dispersion (obtained from 
all true members). Moreover, in the 2dFGRS, the measure- 
ment error in redshift is typically about 85 kms -1 , which 
is quite big in comparison with the typical velocity disper- 
sion of small groups. Therefore, the masses estimated from 
velocity dispersions are not expected to be accurate. In the 
upper two panels of Fig. 11, we show the relation between 
the line-of-sight velocity dispersion of a group (estimated us- 
ing equation 16) and the mass of the host halo for systems 
which contain at least 3 members. The left panel shows the 
result where all true member galaxies in a halo are used to 
estimate the velocity dispersion. Even in this case, the scat- 




log[L 18 /h-%] 

Figure 10. The luminosity Lis (see text for definition) of groups 
selected from the MGRS versus the associated halo mass. Open 
circles with errorbars indicate the mean and 1-a variance of the 
distribution of halo mass for groups with constant Lis, while the 
solid line corresponds to the prediction computed from the CLF. 

ter is quite large, although there is no noticeable deviation 
of the mean from the expected relation (solid lines). The 
right panel shows the result where all selected members (se- 
lected true members plus interlopers) are used. In this case, 
the presence of interlopers contaminates the mass-velocity 
dispersion relation, and the velocity dispersion is slightly 
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Figure 11. The left panels show the relation between the line-of-sight velocity dispersion of (all) true members versus the halo mass in 
an apparent magnitude limited sample. The right panels show that for groups with identified members (true plus interloper). The upper 
and lower panels correspond to results of groups with richness at least 3 and 8, respectively. The symbols and errorbars are the mean 
and 1-cr scatter of the line-of-sight velocity dispersion in each mass bin. The solid line is the mass-velocity relation of the dark matter 
particles within halos and M oc a 3 . The dotted lines in the two upper panels roughly correspond to the completeness limit of the groups 
with members at least 3. 



under-estimated with respect to the expected value. In the 
lower panels of Fig. 11, we plot the same relations, but for 
groups with at least 8 members. Here the scatter in the rela- 
tion is greatly reduced. In the case where all true members 
are used, the velocity dispersion of galaxies in a group is a 
good indicator of mass (see the lower left panel), but bias 
still exists when using the selected group members (lower 
right-hand panel). This is due to the fact that member galax- 



ies with the highest velocities are the easiest to miss by the 
group finder. 



4 APPLICATIONS TO THE 2DFGRS 

Having calibrated and tested our group finder to both vol- 
ume limited mock samples and to flux limited mock galaxy 
redshift surveys, we now apply it to the 2dFGRS. We use 
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Figure 12. The richness (left panel) and group line-of-sight velocity dispersions (right panel) of the groups found in 2dFGRS observations 
and mock samples. The histograms are results of 2dFGRS observations, and different lines are results of different model mock samples. 
The errorbars are l-cr variances of 8 mock samples. Obviously, there are more rich systems in the fiducial mock 2dFGRS samples than 
in the observations. As a comparison, the dotted line in the right panel shows the velocity dispersion of dark matter particles within the 
host halos. 



the final public data release, which contains about 250, 000 
galaxies with redshifts and is complete to an extinction- 
corrected apparent magnitude of bj « 19.45. The survey 
covers an area of ~ 1500 square degrees selected from the 
extended APM Survey (Maddox et al. 1996). The survey 
geometry consists of two separate declination strips in the 
North Galactic Pole (NGP) and the South Galactic Pole 
(SGP), respectively, together with 100 2-degree fields spread 
randomly in the southern Galactic hemisphere. 

In what follows we restrict ourselves only to galaxies 
with redshifts 0.01 < z < 0.20 in the NGP and SGP subsam- 
ples with a redshift quality parameter q > 3 and a redshift 
completeness > 0.8. This leaves a grand total of 151,280 
galaxies with a typical rms redshift error of 85 kms -1 
(Colless et al. 2001). Absolute magnitudes for galaxies in 
the 2dFGRS are computed using the type-dependent K- 
correction of Madgwick et al. (2002). For those galaxies 
without a reliable estimate of r\ we adopt the average K- 
correction (see Madgwick et al. 2002 for details). We use ex- 
actly the same restrictions when selecting galaxies from each 
of our eight MGRSs, yielding samples with 152, 000 ± 4, 000 
mock galaxies. 

4.1 Groups in the 2dFGRS 

Applying our fiducial group finder to our 2dFGRS sample 
yields a group catalogue containing 78708 systems. Among 
these are 7251 binaries, 2343 triplets, and 2502 groups with 
four members or more. Using the FOF group finder, Eke 
et al. (2004a) identified 7020 groups with richness N > 4. 
Therefore, our group finder seems to give a much smaller 
number of N > 4 systems than does the FOF method 
adopted by Eke et al. There are two reasons for this dis- 
crepancy. First of all, as discussed in Section 3.3, the FOF 



method tends to select too many groups at high z ^ 0.1. 
Secondly, our sample is about 25% smaller than theirs, be- 
cause we have imposed a more strict cut in sky coverage to 
get rid of regions where observational incompleteness is rel- 
atively high. When applying the FOF group finder adopted 
by Eke et al. to our 2dFGRS sample, we find 5749 groups 
with richness N > 4, which is consistent with the 25% differ- 
ence in sample size. Taking these two effects into account, 
the number of groups we obtain is roughly comparable to 
that obtained by Eke et al. Of course, since we use a differ- 
ent group finder, the systems we select are expected to be 
different from theirs, and so we do not expect to have an 
exact match in group number. 

The histograms in Fig. 12 show the number of groups 
selected from the 2dFGRS as a function of richness (left 
panel) and line-of-sight velocity dispersion (right panel). In 
the left-hand panel we only show results for groups with 
luminosity Lis > 10 10 ' 5 h~ 2 Lq, which approximately corre- 
sponds to the completeness limit of the group luminosity 
function (see Fig 6), while the right-hand panel only shows 
results for groups with N > 3. The various curves in both 
panels correspond to model predictions obtained from var- 
ious mock catalogues, which will be discussed in the next 
subsection. As a comparison, we plot in the right panel of 
Fig. 12 the velocity dispersion function (which is equivalent 
to the mass function) of the dark matter particles within 
the mock halos. There is significant difference between the 
group velocity dispersion function and the halo velocity dis- 
persion function, which is due to the bias already shown in 
Fig 11 and discussed in Section 3.3. 

The histograms in Fig. 13 show the luminosity functions 
of the 2dFGRS groups. The upper left panel is the result for 
the full magnitude-limited sample. In this case, the observed 
group luminosities are corrected to Lis using the relations 
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Figure 13. Similar to Fig. 6, but for 2dFGRS observations and mock samples. The histograms are results of 2dFGRS observations, 
and different lines are results of different model mock samples. The errorbars are 1-cr variances of 8 mock samples. The upper left panel 
shows the group luminosity function for magnitude limited 2dF samples. The upper right, lower right and lower left panels are the group 
luminosity functions for volume limited 2dF samples with absolute magnitude cut, . — 5 log h < —18.0, —19.0 and —20.0, respectively. 
Again, wc find more bright (rich) groups in the fiducial mock 2dFGRS samples than in the observations. To test the impact of satellite 
galaxy distribution in individual halos, we also plot in the upper left panel the results for the MGRSs that assume velocity bias (thin 
solid line), spatial bias (thin dotted line), and scatter in the M/L (thin long dashed line). Details about these tests are described in the 
text. 



given in Fig. 9. The other three panels are the results for 
volume-limited samples. Comparing the group luminosity 
functions (GLFs) for the different volume-limited samples, 
we see that groups with more than 3 members are complete 
to a luminosity Lg roU p ~ 10 11 hr" 1 Lq for the volume-limited 
sample with M bj < —20.0 + 51ogft, and to a luminosity 
£grou P ~ lO lo ' 5 /!r 2 L for the volume-limited sample with 
M b j < -18.0 + 51og/i. 



4.2 Comparison between model and observation 

The results obtained above for the 2dFGRS groups can 
be compared with model predictions to constrain theories 
of structure formation. Since our MGRSs have been con- 
structed to mimick the 2dFGRS in detail, and since we have 
applied exactly the same selection criteria as for the 2dF- 
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GRS, we can make such a comparison in a straightforward 
way. 

The solid curves in Figs. 12 and 13 show the predic- 
tions obtained from our MGRSs. The errorbars indicate the 
scatter from among our eight independent MGRSs, and il- 
lustrate the expected scatter due to cosmic variance. There 
is a pronounced discrepancy between these MGRSs and the 
2dFGRS: the model predicts too many rich systems that 
have high velocity dispersions and high luminosities. This 
is consistent with Yang et al. (2004), who found that the 
same mock galaxy redshift surveys predict too high ampli- 
tudes for the pairwise peculiar velocity dispersion and for the 
real space two-point correlation function on small scales. As 
shown in Yang et al. , these discrepancies can be attributed 
to the fact that this model predicts too many galaxies in 
massive clusters. 

Before preceeding to seek potential solutions to the 
problem, we first test whether the assumption of satellite 
galaxy distribution in individual halos have signifiant impact 
on our results. Firstly, we consider a case in which satellite 
galaxies have a velocity bias relative tot he dark matter. As 
in Yang et al. (2004), we assume the velocity bias parameter 
b = crg a i/(7DM — 0-6- As an example, we show the result of 
the group luminosity function given by this model as the 
thin solid line in the upper left panel of Fig. 13. Secondly, 
we consider a case where the distribution of satellite galaxies 
in a halo is assumed to be shallower than that of the dark 
matter. Note that this assumption is in fact consistent with 
the result obtained by van den Bosch et al. (2004b). As a 
simple model, we assume the number density distribution of 
satellites in a halo to be 

^M^ro^r 3 ' (i7) 

with a = 0.0 and TZ = 2.0. The group luminosity function 
predicted by this model is plotted as the thin dotted line in 
the upper left panel of Fig. 13. Thirdly, we consider a case 
where we introduce 20% scatter around the mean M/L pre- 
dicted by the CLF when populating dark matter halos with 
galaxies. Note that even without this scatter, there is scatter 
in the M/L among halos of the same mass, because satel- 
lite galaxies are assigned to individual halos in a stochatical 
way (see Yang et al. 2004). The group luminsity function 
predicted by this model galaxies is shown as the thin long 
dashed line in the upper left panel of Fig. 13. Obviously, 
all these changes have only negligible impact on our final 
results. We can therefore rule out the possibility that the 
discrepancy between our fiducial model and observation is 
due to our assumption about the distribution of satellite 
galaxies in individual halos. 

Next, we seek potential solutions to the discrepancy be- 
tween our MGRSs and the 2dFGRS. The CLF used to con- 
struct these MGRSs was constrained to yield an average 
mass-to-light ratio for clusters of {M/L) c \ — 500/i (M/L)©. 
Either the true mass-to-light ratios of clusters are signifi- 
cantly higher (i.e., fewer galaxies per cluster), or there are 
fewer massive halos, which implies a reduction of the power- 
spectrum normalisation as- Yang ct al. (2004) were able to 
reproduce the clustering properties of 2dFGRS galaxies if 
either the cluster mass-to-light ratio is increased to about 
(M/L) cl = 900/i (M/L) , or <j 8 is lowered to about 0.7 
(compared to 0.9 in the concordance cosmology adopted 



so far). The dashed and dot-dashed curves in Figs. 12 
and 13 show the group properties obtained using MGRSs 
with (M/L)d = 900fr (M/L)© and a s = 0.7§, respectively. 
Both very nicely match the multiplicity function, the veloc- 
ity dispersion function, and the luminosity function of the 
2dFGRS groups, thus strengthening the conclusions reached 
by Yang et al. (2004) that either clusters have high mass- 
to-light ratios, which seems incompatible with a number of 
independent observational constraints (e.g., Carlberg et al. 
1996; Fukugita et al 1998; Bahcall 2000), or that the power- 
spectrum normalisation is significantly lower than typically 
assumed. As discussed in van den Bosch, Mo & Yang (2003) 
this latter option cannot be ruled out by current observa- 
tions. 



5 CONCLUSIONS 

The current CDM scenario, in which galaxies are assumed 
to form in dark matter halos, is extremely successful in ex- 
plaining a large range of observational data. Motivated by 
this success, and aided by our detailed knowledge of the 
abundances and properties of dark matter halos within this 
CDM scenario, we have developed a group finder that can 
successfully group galaxies in redshift surveys according to 
their common halos. Using detailed mock catalogues, con- 
structed using large numerical simulations combined with 
the conditional luminosity function of galaxies, we care- 
fully tested the performance of this group finder. Individ- 
ual groups selected using our group finder have an average 
completeness of about 90 percent and with only ~ 20 per- 
cent interlopers. The group luminosities agree with the true 
values to better than 70% level, and the overall group lu- 
minosity function matches the real one well for groups with 

-^group 

<; 10 105 h Lq. 

We have applied our group finder to the 2dFGRS and 
compared the properties of the 2dF groups with those ex- 
tracted from detailed mock galaxy redshift surveys. Al- 
though the 2dF groups have similar properties as the mock 
groups, wc find a clear discrepancy between mock and 2dF 
groups, in the sense that the model predicts too many rich 
systems. In order to match the observational results, we have 
to either increase the mass-to-light ratios of rich clusters 
to a level significanly higher than the typical observational 
value, or assume that as — 0.7 compared to the 'concor- 
dance' value of 0.9. This result is in perfect agreement with 
our previous findings based on the redshift-space clustering 
of galaxies (Yang et al. 2004), and enforces the conclusion 
that the concordance ACDM model may have too high clus- 
tering power on small scales. 

The groups identified by our group finder are closely 
related to the underlying dark matter halos. Given a uni- 
form group catalogue constructed in this way, one can do 
many interesting things that can provide pivotal informa- 
tion about how galaxies form and evolve in CDM halos. We 
plan to come back to some of the problems in forthcoming 
papers. 

§ As in Yang ct al. (2004), the model results for ag = 0.7 arc 
not based on new simulations, but on the re-scaling of the halo 
number density from the simulations of the as = 0.9 model. 



© 0000 RAS, MNRAS 000, 000-000 



18 Yang, Mo, van den Bosch & Jing 



ACKNOWLEDGEMENT 

Numerical simulations used in this paper were carried out 
at the Astronomical Data Analysis Center (ADAC) of the 
National Astronomical Observatory, Japan. We thank the 
2dF team for making their data publicly available. 



REFERENCES 

Bahcall N.A., Ccn R., Dave R., Ostriker J. P., Yu Q., 2000, ApJ, 
541, 1 

Bahcall N.A. et al. , 2003, ApJS, 148, 243 

Beers T.C., Flynn K., Gebhardt K., 1990, AJ, 100, 32 

Berlind A.A., Weinberg D.H., 2002, ApJ, 575 587 

Berlind A. A., Weinberg D.H., Benson A. J., Baugh CM., Cole S., 

Dave R., Frenk C.S., Jenkins A., Katz N., Laccy C.G., 2003, 

ApJ, 593, 1 

Brainerd T.G., Spccian M.A., 2003, ApJ, 593, L7 
Bryan G., Norman M., 1998, ApJ, 495, 80 

Bullock J.S., Kolatt T.S., Sigad Y., Somerville R.S., Klypin A. A., 
Primack J.R., Dekel A., 2001, MNRAS, 321, 559 

Bullock J.S., Wechsler, R.H., Somerville R.S., 2002, MNRAS, 329, 
246 

Carlberg R.G., Yee H.K.C., Ellingson E., Abraham R., Gravel P., 

Morris S., Pritchet C.J., 1996, ApJ, 462, 32 
Carlberg R.G., et al., 1997, ApJ, 485, L13 

Cole S., Aragon-Salamanca A., Frenk C.S., Navarro J.F., Zepf 

S.E., 1994, MNRAS, 271, 781 
Cole S., Lacey C.G., Baugh CM., Frenk C.S., 2000, MNRAS, 

319, 168 

Colless M., et al., 2001, MNRAS, 328, 1039 

Diaferio A., Kauffmann G, Colberg J.M., White S.D.M., 1999, 

MNRAS, 307, 537 
Diemand J., Moore B., Stadel J., 2004, MNRAS, 352, 535 
Eke V.R., et al. 2004a, MNRAS, 348, 866 
Eke V.R., et al. 2004b, astro-ph/0402566 
Fukugita M., Hogan C.J., Peebles P.J.E., 1998, ApJ, 503, 518 
Geller M.J., Huchra J. P., 1983, ApJS, 52, 61 
Jing Y.P., 2002, MNRAS, 335L, 89 
Jing Y.P., Borncr C, Suto Y., 2002, ApJ, 564, 15 
Jing Y.P., Suto Y., 2002, ApJ, 574, 538 
Jing Y.P., Mo H.J., Borner C, 1998, ApJ, 494, 1 
Kang X., Jing Y.P., Mo H.J., Borncr C, 2002, MNRAS, 336, 892 
Katz N., Weinberg D.H., Hernquist L., 1996, ApJS, 105, 19 
Kepner J., Fan X., Bahcall N., Gunn J., Lupton R., Xu G, 1999, 

ApJ, 517, 78 

Kochanek C.S., White M., Huchra J., Macri L., Jarrett T.H., 
Schneider S.E., Mader J., 2003, ApJ, 585, 161 

Kauffmann C, Colberg J.M., Diaferio A., White S.D.M., 1999, 
MNRAS, 303, 188 

Kim R.J.S., et al. 2002, AJ, 123, 20 

Lee B.C., et al., 2004, AJ, 127, 1811 

Lin Y.-T., Mohr J. J., Stanford S.A., 2004, ApJ, 610, 745 
Maddox S.J., Efstathiou G, Sutherland W.J., 1996, MNRAS, 
283, 1227 

Madgwick D.S. et al., 2002, MNRAS, 333, 133 
Magliocchctti M., Porciani C, 2003, MNRAS, 346, 186 
Marinoni C, Hudson M.J., 2002, ApJ, 569, 101 
McKay T.A., ct al., 2002, ApJ, 571, L85 
Merchan M., Zandivarez A., 2002, MNRAS, 335, 216 
Mo H.J., Yang X., van den Bosch, F.C, Jing Y.P., 2004, MNRAS, 
349, 205 

Navarro J.F., Frenk C.S., White S.D.M., 1997, ApJ, 490, 493 
Norbcrg P., ct al., 2002, MNRAS, 336, 907 
Peacock J. A., Smith R.E., 2000, MNRAS, 318, 1144 
Pearce F.R., Thomas P.A., Couchman H.M.P., Edge A.C, 2000, 
MNRAS, 317, 1029 



Postman M., et al., 1996, AJ, 111, 615 
Prada F., ct al., 2003, ApJ, 598 260 

Rines K., Geller M.J., Diaferio A., Kurtz M.J., Jarrett T.H., 2004, 

preprint (astro-ph/0402242) 
Scljak U., 2000, MNRAS, 318, 203 

Scoccimarro R., Sheth R.K., Hui L., Jain B., 2001, ApJ, 546, 20 

Scranton R., 2002, MNRAS, 332, 697 

Sheth R.K., Mo H.J., Tormen C, 2001, MNRAS, 323, 1 

Sheth R.K., Tormen C, 1999, MNRAS, 308, 119 

Somerville R.S., Primack J.R., 1999, MNRAS, 310, 1087 

Spergel D.N., et al., 2003, ApJS, 148, 175 

Tucker D.L., et al., 2000, ApJS, 130, 237 

van den Bosch F.C, Yang X., Mo H.J., 2003, MNRAS, 340, 771 
(Paper II) 

van den Bosch F.C, Mo H.J., Yang X., 2003, MNRAS, 345, 923 
van den Bosch F.C, Norberg P., Mo H.J., Yang X., 2004a, MN- 
RAS, in press, preprint (astro-ph/0404033) 
van den Bosch F.C, Yang X., Mo H.J., Norberg P., 2004b, 

preprint (astro-ph/0406246) 
van der Marcl R.P., Magorrian J., Carlberg R.G., Yee H.K.C, 

Ellingson E., 2000, AJ, 119, 2038 
Wang Y. Yang X., Mo H.J., van den Bosch F.C, Chu Y.Q., 2004, 

MNRAS, in press, preprint (astro-ph/0404143) 
White M., 2001, MNRAS, 321, 1 
White M., Kochanek C.S., 2002, ApJ, 574, 24 
Yan R., Madgcick D.S., White M., 2003, ApJ, 598, 848 
Yan R., White M., Coil A.L., 2004, ApJ, 607, 739 
Yang X., Mo H.J., van den Bosch F.C, 2003, MNRAS, 339, 1057 
(Paper I) 

Yang X., Mo H.J., Jing Y.P., van den Bosch F.C, Chu Y.Q., 

2004, MNRAS, 350, 1153 
Yoshikawa K., Jing Y.P., Borncr G, 2003, ApJ, 590, 654 
Zheng Z., Tinker J.L., Weinberg D.H., Berlind A.A., 2002, ApJ, 

575, 617 



© 0000 RAS, MNRAS 000, 000-000 



